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Abstract 

Global Value Numbering(GVN) is a method for detecting redundant 
computations in programs. Here, we introduce the problem of Global 
Value Numbering in its original form, as conceived by Kildall(1973), and 
present an algorithm which is a simpler variant of Kildall's. The algorithm 
uses the concept of value expression - an abstraction of a set of expres- 
sions - enabling a representation of the equivalence information which is 
compact and simple to manipulate. 



1 Introduction 

Detection and elimination of redundant computations have been interesting top- 
ics in the area of code optimization in compilers. Value Numbering originated 
as a method for detecting redundant computations within a basic block (known 
as Local Value Numbering). The basic idea is to assign a number to each ex- 
pression in such a way that equivalent expressions are assigned the same number 
[5] . Two expressions are said to be equivalent if we can statically determine that 
both the expressions will have the same value during execution. 

The problem of Global Value Numbering(GVN) is how to extend the idea 
of local value numbering to detect redundant computations globally, within 
a program. An initial attempt on GVN can be found in Kildall [4]. In the 
structuring approach of Kildall , for every expression in the program, Kildall 
computes and maintains all its equivalent expressions, leading to exponential 
sized partitions. Most of the works that followed, tried to make the process 
more efficient by means of special program representations and data structures 
[H [31 E H] ■ In general, we feel that there is a lack of clarity in the underlying 
concept of GVN and a lack of simplicity in the solutions. 

In fact, in his implementation notes, Kildall suggested a value numbering 
approach to ensure linear sized partitions [4] . An expression with value numbers 
as operands, called value expression, is used to represent a set of expressions 
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that are equivalent. At program points where multiple control flow paths merge, 
the equivalence information is computed by means of a confluence operator. 
Confluence of equivalence classes involving value expressions is a little bit tricky, 
and may be due to this reason the method did not draw much attention. Here, 
we make use of the concept of value expression to devise a simple algorithm for 
GVN. 

We start with notations and definitions in Section 2. The algorithm for global 
value numbering is given in Section 3. This is followed by some comments on 
the algorithm in Section 4 and conclusions in Section 5. 

2 Notations and Definitions 

2.1 Program Representation 

The input to the algorithm is assumed to be a flow graph, with an empty entry 
node, denoted entry, and an exit node, denoted exit. Each node contains at most 
one assignment statement in three-address cod^. Each assignment statement 
is of the form a; = e, where x is a variable and e is an expression. An expression 
is either a constant, a variable, or an expression of the form y op z where y and 
z are variables or constants and op is an operator. For a node n in the flow 
graph, the input and output points of the node are denoted by INn and OUTn 
respectively. 

2.2 Expression-pool 

For a program point, the expression-pool at that point denotes the set of ex- 
pressions that are equivalent at that point. This is represented as a partition of 
expressions into equivalence classes. Each class in the pool has a value number, 
denoted Vi, where i is a positive integer. For convenience, we show the value 
nwrnfoer of a class as the first element in it. As an example, {[fi, a, a:], [v2, b, y]} 
shows an expression-pool with two equivalence classes. The value number vi is 
assigned to a and x and value number V2 is assigned to b and y. For a node n 
in the flow graph, we use EINn and EOUTn to denote the expression-pools at 
INn and OUTn respectively. 

2.3 Value Expression 

For each expression of the form x op y, we can obtain a value expression, by 
replacing the operands of the original expression by the corresponding value 
numbers. For example, suppose the expression-pool {[wi, a, x], [v2, b, y]} 
reaches a node containing the statement z = x -\- y. Here, the value expression 
of a: -|- ?/ is fi + f2. Instead of the program expression a; -I- y, we put its value 
expression in the pool, with a new value number say W3, to obtain the output 
pool, {[vi, a, x], [v2, b, y], [vs, vi-i-V2, z]}. 

^For simplicity, we do not consider other statements. 
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We can see that the value expression V1+V2, represents not just x+y, but the 
set of equivalent expressions {a+6, x+b, a+y, x+y}. The presence of +z)2 in 
the pool indicates that an expression from this set is already computed, and this 
information is enough for detection of redundant computations. The interesting 
point to note is that a single binary value expression can represent equivalence 
among any number of expressions of any length. As an example, with respect 
to the above pool, the value expression vi +V3 represents the expressions a + z, 
X + z, a + {a + b), a + (x + b), X + {a + 6), and so on. 

3 Value Numbering: Algorithm 

The algorithm computes the expression-pool at every program point. For a 
node n in the flow graph, if EINn is given, then we can compute EOUTn by 
means of the transfer function associated with node n. The points at which 
multiple control flow paths join are called confluence points. For computing the 
expression-pools at such points, we define a confluence operator. 

3.1 Transfer Function 

The transfer function associated with a node n, denoted /„, computes EOUTn 
using EINn, i-e. EOUTn = fn{EINn). Algorithm 1 shows the transfer function 
for a node n containing an assignment x ~ e, where a; is a program variable and 
e is an expressioiH. The assignment can be considered as killing all expressions 
involving the variable x and generating the new equivalence between x and e. 
The effect of killing expressions is achieved by removing x from its class, say Ci. 
Now, if Ci is a singleton with its value number, say Vi, as the only element in 
it, we delete the class Ci and any value expressions involving Vi from the pool. 
The function deleteSingletons{E) is assumed to do these steps repeatedly till 
no singleton classes remain in the pool E. 

The function valueExp{e) returns the value expression of e, if e is of the form 
X op y, and returns e itself otherwise. If e is already assigned a value number in 
EINn, say Ve, then we put x in the same class as that of e. Otherwise, a new 
class containing x and e is created together with a distinct value number in it 
and this new class is added to the output pool (if e contains an operator, then 
instead of e, we add its value expression). 

3.2 Confluence Operator 

The expression-pool at a confluence point should contain the sets of equivalent 
expressions common to all incoming pools. The common expressions that are 
explicitly present in the input pools can be obtained by a simple class-wise 
intersection of expression-pools. The hard part is obtaining the equivalence in- 
formation based on the value expressions in the incoming pools. In the example 

■^if X occurs in e, we assume that the statement is split into t = e followed hy x = t where 
t is a new distinct temporary variable. 
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Algorithm 1: Computes EOUTn = fn{EINn), for a node n containing 

the assignment x = e. 

Et = EIN^; 

if {x is in a class G Et) 

then remove x from Cx', 

deleteSingletons{Et)\ 
e' = valueExp{e); 
if (c' is in a class Ce' € Et) 

then add x to Cg' ; 

else create a new class Ck, with x and e' together with a new 
value number Vk in it, and add Ck to Et; 
EOUTr, = Et; 
return EOUT^; 



Bi: {[vi, X, a], lv2, y, b], [vi, V1+V2, z]} B2: {[vt, x, c], [vs, y, d], [ve, vt + vs, s]} 




E3: {[vt, x], [vs, y], [tig, VT + Vs]} 



Figure 1: Computing confluence 

shown in Figure 1, we see two expression-pools, Ei and E2, reaching a conflu- 
ence point. Let E3 be the pool resulting after confluence. Since x occurs in 
both the input pools Ei and £"2, wc put it in the output pool E3. Since the 
value numbers of x are different in the input pools, we assign a new distinct 
value number vr for the resulting class. Similarly, we put y in Es with new value 
number . In Ei, the value expression f 1 + V2 represents the set of equivalent 
expressions x + y, x + b, a + y, and a + b. In E2, the value expression V4, + V5 
represents the set of equivalent expressions x + y, x + d, c + y, and c + d. Here, 
we see a common expression x + y represented by vi + f 2 in Ei and v^ + vc^ in E2 ■ 
Let us now devise a method to collect such common expressions by examining 
the value expressions in the input pools. 

Consider the corresponding operands of the pair of value expressions vi + V2 
and Vi+v^. We see a common element x in the classes of vi and V4, and a common 
element y in the classes of V2 and U5. In other words, the intersection of the 
classes of vi and W4 is non empty and also the intersection of the classes of V2 
and U5 is non empty. This is enough to infer that there is a common expression 
represented by the two value expressions. At confluence, the intersection of the 
classes of vi and results in a class with value number and the intersection 
of the classes of V2 and results in a class with value number ug. Hence the 
common expression x + y, after confluence, gets the value expression v-j + vs 
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and this can be added to E^. 

In general, let there be a class with value number Vi and value expression 
Vii+Vi2 in El, and let there be a class with value number vj and value expression 
Vji + Vj2 in E2. If the intersection of the classes of vn and Vji results in a non 
empty class with value number Vki, and the intersection of the classes of Vi2 and 
Vj2 results in a non empty class with value number Vk2, then we can conclude 
that the pair of value expressions, vn + Vi2 and Vji + Vj2, represent a common 
expression e in the two pools. The value expression of e after confluence is 
Vki + Vk2 and this can be added to E3. 

An algorithm for computing the confluence of two expression-pools Ei and 
Ej is given in Algorithm 2. We use the symbol /\ to denote the confluence 
operation. The algorithm takes each pair of classes, Ci S Ei and Cj G Ej, 
and finds the common expressions in Ci and Cj (cither explicitly present or 
implicitly represented by value expressions). The operation of finding the 



Algorithm 2: Computing confluence of expression-pools, Ei and Ej, i.e. 

E^^Ej. 

foreach pair of classes, C, e Ei and Cj G Ej 
Ck = Ci n Cj-, 
if {Ck ^ 

then add Ck to Ek; 

deleteSingletons{Ek); 
return Ek; 



common expressions in Cj and Cj can be considered as a special intersection 
and we denote it as Ci n Cj . Algorithm 3 shows the computation of Cj n Cj . 

Algorithm 3: Computing Ci □ Cj. 

Note: A class with value number Vn is denoted by C„ and vice-versa. 
Cfe 

foreach e G Cj fl Cj 

add e to Ck; 
if (Cj and Cj have different value expressions) 

then 

// let Vii + Vi2 and Vji + Vj2 be the value expressions 
II in Cj and Cj respectively 

Cfei = Cji n Cji; 
Ck2 — Ci2 n Cj2; 
if (Cfci ^ $ and Ck2 + $) 

then add the value expression Vki + Vk2 to Ck; 
if (Cfc ^ $ and Cfc does not have a value number) 

then add a new value number, say Vk, to Ck; 
return Cfc; 
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3.3 The Algorithm 



Algorithm 4 shows the main function for Global Value Numbering. T is the 
top element such that /\T = Ei, for any expression-pool Ei. For a node n, 



Algorithm 4: Computes EINn and EOUTn for each node n. 

EOUTentrv - 

foreach node n ^ entry do EOUTn = T; 
while (changes to any EOUT occur) do 

//We mean changes in the equivalence information. 

// The chcinges only in value numbers Ccin be ignored. 

foreach node n ^ entry do 
EINn = A EOUTp; 

pGpred(n) 

EOUTn = fn{EINn); 



pred{n) denotes the set of immediate predecessors of n, and /\ computes 

pEpred{n) 

the confluence of expression-pools that reach the output of its predecessors. 

4 Comments on the Algorithm 

Power of the Algorithm Figure 2 shows an example of redundancy detec- 
tion which will demonstrate the power of the algorithm, especially that of the 
confluence operation. Let us use Cj to denote a class with value number Vi. The 

c = a + b d = p + q 

e = c + z f = d-{- z 

Bi: {[vi, X, a], [V2, V, b\, [V3, V1+V2, c], B2: {[v^, p, x], [«r, g, y], [vi, z], 

[vi, z], [va, vz + Vi, e]} [va, v^ + vt, d\, [og, vs + va,, /]} 




-E3: {[VW, x}, [Vu, y], [t)4, Z],[V12, Oio+fll], [Vl3, Vl2+Vl]] 

g = + y 
h = g + z 

Figure 2: Value Numbering to detect redundancy 

value expression v\ + V2 in C3 and vq + vr in Cg represent a common expression 

x + y. When we compute the confluence, C^FlCs results in the class C12 and the 
value expression Vio + Vn in it represents x + y. Another common expression is 
{x + y) + z, represented by the value expressions in C5 and Cg. The operation 
C5 n Cg, results in C13, whose value expression V12 + W4 represents (x + y) + z. 
After confluence, when we do value numbering of g = x + y, since x + y maps to 
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"^10 + vii, a value expression in E^, it is detected as redundant and g gets value 
number vi2. Similarly, the expression g + z maps to vi2 + V4 and hence this is 
also detected as redundant. 

A Comparison With Some of the GVN Algorithms In terms of power, 
our algorithm is as precise as Kildall's approach [4 . Alpern, Wegman, and 
Zadeck's (AWZ) algorithm [T] is an efficient algorithm for GVN, but is not 
as precise as Kildall's. The AWZ algorithm fails to detect the category of equiv- 
alences shown in Figure 2. The algorithm given by Gulwani and Necula 3 does 
intersection of only those classes having at least one common variable. But as 
per our observation, intersection of all pairs of classes is required for detecting 
the kind of equivalences similar to that shown in Figure 2. 

5 Conclusion 

An algorithm for Global Value Numbering is presented. The concept of value 
expression enables a compact representation of equivalence and simplifies the 
computation of confluence. It may be noted that a single binary value expression 
can represent equivalence among any number of expressions of any length. We 
feel the algorithm is simpler compared to that available in the literature. In 
terms of power, it is as precise as Kildall's approach. 
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